Error correction for massive datasets

نویسندگان

چکیده

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Error correction for massive datasets

The paper is concerned with the problem of automatic detection and correction of errors into massive data sets. As customary, erroneous data records are detected by formulating a set of rules. Such rules are here encoded into linear inequalities. This allows to check the set of rules for inconsistencies and redundancies by using a polyhedral mathematics approach. Moreover, it allows to correct ...

متن کامل

Issues in preprocessing current datasets for grammatical error correction

In this report, we describe some of the issues encountered when preprocessing two of the largest datasets for Grammatical Error Correction (GEC); namely the public FCE corpus and NUCLE (along with associated CoNLL test sets). In particular, we show that it is not straightforward to convert character level annotations to token level annotations and that sentence segmentation is more complex when...

متن کامل

Spatial Prediction for Massive Datasets

Remotely sensed spatio-temporal datasets on the order of megabytes to terrabytes are becoming more common. For example, polar-orbiting satellites observe Earth from space, monitoring the Earth’s atmospheric, oceanic, and terrestrial processes, and generate massive amounts of environmental data. The current generation of satellites, such as the National Aeronautic and Space Administration’s (NAS...

متن کامل

Massive Datasets in Astronomy

Astronomy has a long history of acquiring, systematizing, and interpreting large quantities of data. Starting from the earliest sky atlases through the first major photographic sky surveys of the 20th century, this tradition is continuing today, and at an ever increasing rate. Like many other fields, astronomy has become a very data-rich science, driven by the advances in telescope, detector, a...

متن کامل

Typing Massive JSON Datasets

Cloud-specific languages are usually untyped, and no guarantees about the correctness of complex jobs can be statically obtained. Datasets too are usually untyped and no schema information is needed for their manipulation. In this paper we sketch a typing algorithm for JSON datasets. Our approach can be used to infer a succinct type from scratch for a collection of JSON objects, as well as to v...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Optimization Methods and Software

سال: 2005

ISSN: 1055-6788,1029-4937

DOI: 10.1080/10556780512331318281